From data warehousing to active information integration systems

نویسنده

  • Mukesh K. Mohania
چکیده

Enterprises have gathered operational business information frommultiple structured data sources and stored it in a central repository, called data warehousing, for decision support functionalities and data analysis. The enterprises are now realizing to integrate their entire information sources, including "unstructured" contents, for deeper and richer information analysis. Several applications, such as processing warranty claims, finding promotional materials in real-time based on user’s transaction value, detecting health insurance claim processing frauds in (near) real-time by integrating information from various data sources (some of them may be from the competitors), etc., require integration of both structured and unstructured information based on events and business policies. Thus, it is vital for data warehousing to enable the integration of data and content sources to provide real-time read and write access, to transform data for business analysis and data interchange, and to data placement for performance, currency and availability. In this talk, we will first review the existing technologies in data warehousing and information integration, and then discuss how the enterprise applications are moving from data warehousing to (Active) Information Integration system. We will also discuss an architecture of a new approach for integrating information based on policies that does not require to defining a global schema (virtualization approach) or any materialization of pre-computed results (warehouse approach). We will finally discuss several applications that require such kind of integration, and show that the current approaches cannot satisfy these applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Toward Active XML Data Warehousing

Warehousing data is not a trivial task, particularly when dealing with huge amounts of distributed and heterogeneous data. Moreover, traditional decision support systems do not feature intelligent capabilities for integrating such complex data. Therefore, we propose an approach for intelligent decision support based on active XML warehousing. We exploit XML as a pivot language in order to unify...

متن کامل

Semantic Integration of Structured and Unstructured Data in Data Warehousing and Knowledge Management Systems

Nowadays, increasing information in enterprises demands new ways of searching and connecting the existing information systems. This chapter describes an approach for the integration of structured and unstructured data focusing on the application to Data Warehousing (DW) and Knowledge Management (KM). Semantic integration is used to improve the interoperability between two well-known and establi...

متن کامل

Database Research at UT Arlington ( ITLab @ CSE . UTA )

The Information Technology Laboratory (or ITLab) at the Computer Science and Engineering Department at The University of Texas at Arlington was established by Sharma Chakravarthy in Spring 2000. The mission of the ITLab is to conduct research and development on all aspects of information technology. Some of the topics currently being investigated are: Data Warehousing/Information Integration, D...

متن کامل

Active XML-based Web data integration

Today, the Web is the largest source of information worldwide. There is currently an increasing demand that decision-making applications such as Data Warehousing (DW) and Business Intelligence (BI) move onto the Web, especially in the cloud. Integrating data into the DW/BI applications is a critical and timeconsuming task. To make better decisions in DW/BI applications, next generation data int...

متن کامل

0 Dwhuldol ] Lqj : He ' Dwd

Business decisions must rely not only on company-internal data but also on external data from competitors or relevant events. This information can be obtained from the WWW but must be integrated with the data in a company's data warehouse. In this paper we discuss a system architecture for warehousing Web content for OLAP and DSS. A self-describing object model is used to make the implicit mode...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007